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ABSTRACT 

The authors of the article describe the popular trends and methods as well as ICT tools used for the mapping and 
visualization of scientific domains as a research methodology which is attracting more and more interest from scientific 
information and science studies professionals. The researchers analysed Pajek, one of the programs used for the 
processing and visualization of bibliographic and bibliometric data, within the framework of the implementation of IRNet 
research network project and activities, and presented several examples of visualisation. 


KEYWORDS 

Networking, Bibliographic and Bibliometric Data, Mapping, Visualization, Applications 


1. INTRODUCTION 

Modern science is undergoing enormous changes and transformations. Science 2.0 is a term used to refer to 
processes, trends and phenomena related to the use of new technologies and information and communication 
technologies in science, and in particular, tools, services, publications and online resources. This trend 
stresses the benefits of increased collaboration and cooperation between researchers and scientists. However, 
it refers primarily to the consequences of using these technologies. 

At this stage of Web 2.0 development a popular trend has emerged whereby knowledge resources are 
mapped from public web services. For example, a map of English Wikipedia articles was generated using a 
measurement registering common categories of articles (Holloway, Bozicevic, Borner, 2007). User activity 
specialist services also inspire analysis and mapping of data such as blogs and forums, users’ logs (Bollen, 
Van de Sompel, Hagberg, Bettencourt, Chute, Rodriguez, Balakireva, 2009) etc. The concept of the mapping 
and visualization of scientific domains as a research methodology is presented in Figure 1. 

Scientometric and bibliometric approaches are being increasingly used by some authors to assess the 
evolution and structure of scientific knowledge and R&D output (e.g., Meyer, M. (2004), Wagner, C. S., and 
Leydesdorff, L. (2005), Dietz, J. S., and Bozeman, B. (2005), Adams, J. D. (2006), Hussler, Caroline, and 
Ronde, Patrick (2007)). Normally, studies within this research field (Meyer, M. (2000b), Meyer, M. (2004), 
Wagner, C. S., and Leydesdorff, L. (2005)) aim to appraise the scientific output of individuals, journals and 
even organizations (e.g., effective publication in internationally refereed journals, high citation scores) by 
surveying and analyzing co-authorships and citation indexes. 


2. SCIENCE MAPPING OR BIBLIOMETRIC MAPPING 

The existing methods of information visualization have been successfully adapted in a network environment 
where websites, with some approximation, can be regarded as scientific articles, and hyperlinks as quote 
links. This approach has promoted the rapid development of Webometrics (Zhao, Strotmann, 2008), Osinska, 
2010 ). 
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Science mapping or bibliometric 

mapping 


Science mapping or 
bibliometric mapping is a 

spatial representation of how 
disciplines, fields, specialities, 
and individual documents or 
authors are related to one 
another 


(a) data sources, 

(b) units of analysis, 

(d) similarity measures that can be used to normalize 
the relations between the units of analysis, 

(e) mapping steps, 

(f) types of methods of analysis that can be employed, 

(g) some visualization techniques, and finally, 

(h) interpretation of results 


Different approaches have 
been developed to extract 
networks using the selected 
units of analysis (authors, 
documents, journals, and 
terms). 

Co- author 

Co-word 

Co-citation 


Figure 1. Science Mapping Concept 

According to (Wagner, Leydesdorff, 2005), authors within this research field are interested in the increase 
of the interconnectedness of scientists (e.g., Okubo, Miquel, Frigoletto, and Dore, 1992), Luukkonen, 
Tijssen, Persson, Sivertsen, 1993; Zitt, Bassecoulard, Okubo, 2000; Glanzel, 2001; Cantner, Graf, 2006), in 
figuring out patterns of collaboration in general (e.g., Chung, Cox, 1990, Gibbons, Limoges, Nowotny, 1994; 
Katz, Martin, 1997; Dietz, Bozeman, (2005), Hussler, Ronde, 2007, and of international linkages in particular 
(e.g., Stichweh 1996, Schott, 1998) and further analyzing implications of linkages for funding and outcomes 
(e.g. Van Den Berghe, Houben, De Bruin, Moed, Kint, Luwel, Spruyt, 1998; Wagner, Yezril, Hassell, 2000; 
Sequeira, Pacheco, Teixeira: p. 8-16). 

To better understand the structure and dynamics of the development of individual science departments 
and to find a way to identify thematic trends, researchers analyze literature and the paths to cite individual 
publications. Analytical citation has been used in bibliometrics for a long time. The term developed by 
KDViz (Knowledge Domain Visualization) was coined by Ch. Chena in 2001, editor of the leading 
information visualization magazine Information Vizualizatiorf (common abbreviation: InfoViz). Because 
these methods lead to the generation of graphical maps, another name is used in parallel: Mapping Science 
or, less often, scientography. This last one was introduced in 1960 by E. Garfield, founder of the Institute for 
Information Science in Philadelphia (ISI). Over the past 10 years, the domain knowledge visualization 
research has been extended to include information retrieval tasks (Osinska, 2010). 


3. SCIENCE MAPPING 

Science mapping or bibliometric mapping is a spatial representation of how disciplines, fields, specialties, 
and individual documents or authors are related to one another (Small, 1999). It is focused on monitoring a 
scientific field and delimiting research areas to determine its cognitive structure and its evolution (Noyons, 
Moed & Van Raan, 1999). There are different important aspects to a science mapping analysis, such as: (a) 
data sources, (b) units of analysis, (c) data pre-processing, (d) similarity measures that can be used to 
normalize the relations between units of analysis, (e) mapping steps, (f) types of methods of analysis that 
can be employed, (g) some visualization techniques, and finally, (h) interpretation of results. 

Different approaches have been developed to extract networks using selected units of analysis (authors, 
documents, journals, and terms). Co-word analysis (Callon, Courtial, Turner & Bauin, 1983) uses the most 
important words or keywords of the documents to study the conceptual structure of a research field. 
Co-author analyzes authors and their affiliations to study the social structure and collaboration networks 
(Ganzel, 2001); Peters & Van Raan, 1991). 

Finally, the cited references are used to analyze the intellectual base used by the research field or to 
analyze documents that cite the same references. In this sense, bibliographic coupling (Kessler, 1963) 
analyzes citing documents, whereas co-citation analysis (Small, 1973) studies cited documents. Other 
approaches such as author bibliographic coupling (Zhao & Strotmann, 2008), author co-citation White & 
Griffith, 1981), journal bibliographic coupling (Gao & Guan, 2009; Small & Koenig, 1977), and journal 
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co-citation (Mccain, 1991) are examples of macro analysis using aggregated data (Cobo, Lopez-Herrera, 
Herr era-Viedma, Herrera, 2011: 1382). 

Interesting research results were received by Spanish authors (Gomez, Ignacio; Vazquez-Cano, Esteban; 
Lopez-Meneses, Eloy, 2016). According to their research, Spain was in 2013 the leading European country in 
MOOC course offerings and is in a leading position worldwide in the number of massive courses offered in 
2014. This prolific activity is being transferred to the educational and scientific world in the form of posts in 
blogs, social networks and web pages, as well as scientific papers and books that attempt to analyze the 
movement from different methodological approaches. To date there is not any research that analyzes the 
bibliometric impact of MOOC movement in the Spanish scientific community. Therefore the objective of this 
research is to perform a bibliometric study of the scientific impact in the form of scientific article or research 
book in journals and Spanish publishers from January 2010 to June 2014. The study was approached from a 
descriptive and quantitative methodology, taking as reference bibliometric indicators of production, number 
of citations, and indicators of visibility according to their impact on different databases: WoS / Social 
Science Citation Index, Scopus, In-Recs, Google Scholar and categorization of Spanish scientific journals 
(ANEP / FECYT). The results show that the impact of the Spanish scientific production in the form of books 
and scientific articles in prestigious international databases (WoS-SSCI / Scopus) is very low, although the 
national impact categorization according to ANEP / FECYT and In-Recs is moderately high (Gomez, 
Ignacio; Vazquez-Cano, Esteban; Lopez-Meneses, Eloy, 2016). 

Mapping and visualization on the TimeLine is comprehensively described and presented in Vieslava 
Osinska (2016). 

The following software products are used for processing of bibliographic and bibliometric data: 

• CiteSpace, CiteSpace II 

• Bibexcel 

• Pajek 

• VOSViewer 

• HistCite 

• XLSTAT 

• Publish or Perish 

• VantagePoint 

• Sci2 Tool 

• UCINET 

Applications used for a typical statistical analysis of large data sets include Statistica, Origin, Matlab, 
Mathematica. In the list above, special note should be made of CiteSpace - a continuously developed 
complete tool supporting all stages of the visualization process from data extraction to validation of the 
resulting graphic configuration. 


4. MAPPING AND VISUALIZATION OF IRNET PROJECT 
NETWORKING ACTIVITIES 

Within the framework of the IRNet project (www.irnet.us.edu.pl ) research is being conducted in several 
WPs which are separate, yet simultaneously connected through interrelated stages, which roughly address our 
several research questions.. A bibliographic database was created containing all the published articles, books 
from 2014 to date by all IRNet researchers (178 publications) according to the following rubrics: names and 
surnames of (co)authors, affiliation, title of publication, type (article, chapter, book, etc.), where published 
(conference proceedings, monograph, journal, etc.), year key words, indexing. 

This database also includes information regarding the number of authors, authors' affiliation, country, 
year, research areas, keywords, the source of publication (e.g. journal, book, etc.) and certain other data. 
Consequently, this dataset enables us to assess the main trends in IRNet scientific publications production. 
The time frame of the analysis is the last 4 years of IRNet existence, in which we have been able to trace its 
knowledge production and dissemination. 

Based on the dynamics of international co-authorships, we will be able to map and trace international 
collaboration patterns and thus infer IRNet geographical influence scope, i.e. its international influence 
(Research Question). 

By means of additional research utilizing information available in the Institute for Scientific Information 
(ISI), namely in the Science Citation Index (SCI), we assess the geographical pattern of the citations of IRNet 
scientific activities, publications, production. This enables us to evaluate to what extent IRNet project 
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scientific production has been cited at the European and world level. 

As the subject matter of the project is related to educational technologies or, to be more precise, to 
e-learning technologies, which is reflected in the participants’ publications, one can assume that an analysis 
of international collaboration and the degree of participants’ involvement in various countries etc., will 
indirectly allow for an evaluation of the objectives of e-leaming technology development in each partner 
country and institution. 


5. METHODOLOGY OF RESEARCH 

Among the main indicators of research effectiveness - bibliometric indicators - is a powerful information tool 
to support the development of science. Practical research methods were prepared using the programs: Bibexel 
(metadata analyses), and program Pajek: graph editing and visualization of the graph structure of 
co-authorship - using the method of Kamada-Kawai (Tomihisa Kamada & Satoru Kawai). 

For the study of science as a process of scientific communication, which is relevant to this study, one can 
use the tools of network analysis. As a tool for analysis and visualization of large scale networks we can 
recommend Pajek (Pajek, 1996). However, for complex types of analysis, one must have competencies 
helping one to work with specialized products and to develop programming skills. 

To carry out a network analysis of participants of the project, we will build a network of co-authorship for 
research activity and presentation of results in scientific publications. 

Analyses of data for the purpose of network analysis were performed using Bibexel, starting with 
obtaining initial data for this program. These data (up to the end of 2016) were assembled using built-in tools 
of the scientometric database Web of Science. Data for each participant in the project IRNet are files with the 
extension .txt which were combined using the program Bibexel: in the new window “Type new/file name 
here”, one should enter name which needs to have the merged file, select all the files one wants to merge, and 
run Files -> Append all selected files to another. 

For building a graphics editing program Pajek (http://mrvar.fdv.unilj.si/pajek//) the following objects 
were used: 

1. Networks are the main objects (vertices and lines), and the default extension is: .net. In this case, the 
created file is Analiz.net of the 83 vertices - authors’ articles and links between them, reflecting the 
co-authorship in publications. The file specifies an additional property of vertices - the identity of 
the authors who analyzed the participants of the project (diamond). 

*Vertices 83 

1. 1 "Kommers P." diamond 

2. 2 "Smyrnova-Trybulska E." diamond 

3. 3 "Morze N." diamond 

4. 4 "Noskova T." diamond 

5. 5 "Pavlova T." diamond 

6. 6 "Yakovleva O." diamond 

7. 7 "Sekret I." diamond 

8. 8 "Zavgorodnyi V." diamond 

9. 9 "YalovaK." diamond 

10. 10 "Yashina K." Triangle 

*Edges 
1 2 10 
1 3 8 

2. Vectors - file Analiz.vec, contains 83 entries, where for each vertex (author) its quantitative 

characteristic (real number) - the number of publications is indicated. *Vertices 83 

1 12 2 58 3 53 4 27 5 30 

3. Partitions - file Analiz.clu, where for each vertex the class is defined to which it belongs. In this case, 
there are 10 classes, in line with the countries’ mission project participants: 

1- Australia, 2- Czech Republic, 3 - Netherlands, 4 - Poland, 5- Portugal, 6 - Russia, 7- Slovakia,8- 
Spain, 9 -Turkey, 10 - Ukraine. 

*Vertices 83 

3 4 10 6 6 6 
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To build a network visualization of co-authorship in the program Pajek, it is necessary to download the 
file Analiz.net in the Networks field. Vectors - Analiz.vec Partitions - file Analiz.clu, and select Draw -> 
Network + First Partition+ First vector. 

As for building the visualization of networks linking scientists with special software and settings, the 
project participants’ "labeled" rectangles (diamond) and the corresponding vertices have the signatures - the 
names of the scientists, not the participants - triangles (Triangle); shapes-marks are of different colors 
depending on the country, which represent the authors, and the size that is determined by the number of each 
author’s publications. As can be seen in Figure 2, the scientists have different publication activity, some who 
are not the participants of the project have a greater number of publications than some of the participants. 
The latter can be considered as a basis for widening the circle of participants or an invitation of active 
scientists to other projects. The window where in the text file the relationships of a particular author 
(co-author) is described, can be called using the right mouse button for the selected node. 

As a result of the visualization of the graph structure of co-authorship we have obtained a Kamada-Kawai 
layout (Tomihisa Kamada & Satoru Kawai) (Figure 2). 



Figure 2. Example of Visualization of Co-Authorship with the Countries’ Mission of Project Participants 

Kamada-Kawai Layout 

This made it possible to analyze the contribution (number of publications) of the project participants and 
their collaborators within each country. For example, the largest representation of scientists are from 
Ukraine, representatives from Russia (3 participants) did not attract other sponsors from their country, but 
their activity was high, and there are countries, represented by one party such as Turkey (Sekret I.) and the 
Netherlands (Kommers P.), but the representative from the Netherlands cooperated with other project 
participants actively. 

However, it should be noted that visualization network analyses are often unreadable because they 
contain a large number of nodes with one or two connections. 

The concentration of connections allows for visually accentuating the centers of the analyzed activity, but 
it complicates general perception and makes the label unreadable. To view maps based on the results of the 
network analysis, it is recommended that the concentration of connections be reduced. Therefore, by means 
of the program Pajek a subnet was created consisting only of participants of the project 
(Operations/Network+Partition/Extract SubNetwork). When one starts this, first it is necessary to create a file 
- Part.cls where the elements of the desired cluster should be grouped. In this case it is 42 participants. Such 
a file can be generated automatically, if necessary, changes can be made manually. 

For a network that contains only the data about participants of the project, files and partition vectors are 
formed. You can optionally save this data as separate files or a network in general - with the file extension 
.paj (file/ Pajek project file /save). 

As a result of visualization of the graph structure whose vertices became only the authors of the project, 
we obtain a Kamada-Kawai layout in which the data are not grouped with representation (Figure 3). 
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Figure 3. Example - The Visualizing of the Graph Structure of Co-Authorship of the Project Participants 

Kamada-Kawai Layout 


As can be seen from Figure 3, the distribution of the contribution of the project participants by the 
number of posts is not uniform: the center is represented by more active members. For each network node the 
author's name and the number of publications are added (which also affects the size of the geometric area). 
The color is also allocated to the country offices participating in the project. 

It should be noted that the visualization, basically, is a brief synthesis of the results of the analysis that 
allows us to understand the context of the transition to the data that underpin it. The credibility presented in 
the form of a visualization of the analysis results will be high, if there is a verification of the results of the 
analysis, that is, all the visualization elements retrieved can be traced back to the primary data and methods 
of their processing. 

The results of this visualization can be used to make optimal decisions on the management of information 
resources with the aim of improving processes of scientific communication and the evaluation publications 
contribution of each participant. To determine the scientific impact of the project participants in the 
development of pedagogical science an additional analysis needs to be conducted. 


6. CONCLUSION 

This publication is devoted to the one of the popular trends referred to as mapping of scientific domains and 
visualization of research results in international networks. It can be considered as a research methodology 
which is gaining more and more popularity among scientific information professionals and specialists from 
different disciplines. 

There is an extensive offering of various types of software for processing of bibliometric data - open 
source license as well as commercial license programs. 

The authors of the article, who are researchers of the European IRNet project and international research 
network, described and analysed certain bibliographic results of these activities using methods of mapping 
and visualization of scientific domains. 

The methods for visual representation of scientometric information can serve as a basis for formulating 
preliminary working hypotheses in scientometric data analyses and for presenting the final results of 
analyses, in this case being a statistical analysis (a scholar’s profile), a geospatial analysis (of member states) 
and a network analysis (participation in the project). Scientometric information mapping is closely related to 
the task of preparing forecasts of science development and improving the quality of existing forms of 
collaboration as well as developing new ones. The results obtained give grounds for supposing that the 
development of science branches and disciplines depends, to a large extent, on the existence and 
development of human capital and collaboration forms (we examined collaboration on this project only, but 
collaboration as such is limited to that) and, to a lesser extent, on identifying prospective fields of study and 
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areas of technology. Further research will be pursued, among other things, to design a thematic-based 
collaboration model (constructing scientific e-communication models) and to assess the effect on scientists 
(and individual scientists) and science in general. 
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